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Abstract 

The nearly neutral theory emphasizes the interaction of drift and weak selection in evolution. With progress of genome 
biology, the applicability of the nearly neutral theory has expanded. The genome-wide analyses of synonymous and 
nonsynonymous substitutions at protein-coding regions show prevalence of very weak selection. Many patterns of evolution 
of gene regulation are also in agreement with the nearly neutral prediction. Our consideration on near-neutrality expands in 
relation to the progress on molecular understanding of robustness and epigenetics. Both are bridges to link genotypes with 
phenotypes and important for understanding how weak selection and drift interact in the evolution of complex systems. 
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Introduction 

The nearly neutral theory was put forward in 1973 as an ex- 
tension of the neutral theory (Ohta 1973). Simply put, the 
nearly neutral theory posits that there exists a significant class 
of alleles that are sufficiently weakly selected such that the 
evolutionary dynamics are governed by both drift and selec- 
tion. At the beginning, the theory was directed toward amino 
acid substitution in evolution. There are now various data from 
amino acid substitutions to genomic structural variations re- 
lating to the nearly neutral theory. In the following, I present 
progress in the applicability of the theory; near-neutrality is 
widely observed at the protein level, and divergence pattern 
of gene expression is mostly consistent with near-neutrality. 

Another interesting development on near-neutrality is 
found in the concept of robustness of gene regulatory net- 
works. The nearly neutral theory has become more signifi- 
cant in understanding morphological evolution because 
gene regulation is thought to be the basis of organismal de- 
velopment, and nearly neutral mutations would increase un- 
der robust systems as compared with sensitive systems. 
Intimately, related subjects are epigenetic phenomena of 
which our understanding is rapidly increasing. Its relation 
to near-neutrality is also discussed. Note that both robust- 
ness and epigenetics lie between genotype and phenotype. 

Evolution of Protein-Coding Regions 

The nearly neutral theory is directed toward understanding 
the interaction of drift and selection. Figure 1 shows the 



comparison of the selection, the neutral and the nearly neu- 
tral theories on how new mutations are classified. An impor- 
tant prediction of the nearly neutral theory is the negative 
correlation between the evolutionary rate and the effective 
population size of species. This is because, in the nearly neu- 
tral class, slightly deleterious mutations are much more 
abundant than slightly advantageous ones (Ohta 1992). 
Also, if selection coefficients are dependent upon conditions 
determined by genetic backgrounds and environments, 
even slightly advantageous mutant substitutions may be- 
come rapid in small populations as compared with large 
populations (Ohta 1972). 

Comparison of the pattern of synonymous and nonsy- 
nonymous substitutions at the protein-coding regions gives 
opportunities for verifying the population size effect. Note 
that mammalian lineages are characterized as follows: or- 
ganisms with long generation time tend to have small pop- 
ulation size and vice versa. So population size tends to be 
large for rodents, middle for artiodactyls, and small for pri- 
mates. I analyzed the 49 gene sequences of three orders, 
primates, artiodactyls, and rodents (Ohta 1995). The results 
are in accord with the prediction, that is, the rodent branch 
is much longer than the primate branch for the synonymous 
tree, but the difference of the two lineages is not so large for 
nonsynonymous tree. This is because of the population size 
effect of nearly neutral amino acid substitutions in compar- 
ison with synonymous changes. Genome data show simi- 
lar pattern, for example, the ratio of nonsynonymous to 
synonymous divergence is higher in human-chimpanzee 
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Fig. 1. — Diagram to show how new mutations are classified under 
the selection, the neutral, and the nearly neutral theories. The diagram 
of the nearly neutral theory is made by folding that of the neutral 
theory. Therefore, the selected class includes both advantageous and 
deleterious mutations and the nearly neutral class, both slightly 
advantageous and slightly deleterious one. 



comparison than in mouse-rat comparison (The Chimpan- 
zee Sequencing and Analysis Consortium 2005). 

A second key prediction of the nearly neutral theory is 
abundance of low-frequency polymorphisms as compared 
with strict neutrality. Data on DNA polymorphisms reveal 
prevalence of nearly neutral amino acid substitutions. For 
example, Hughes et al. (2003) have analyzed human SNP 
data and found that amino acid changing SNPs show re- 
duced diversity as compared with synonymous SNPs in 
agreement with the nearly neutral prediction. 

The McDonald-Kreitman (M-K) test (McDonald and 
Kreitman 1991), that compares the relative numbers of syn- 
onymous and nonsynonymous substitutions within a popu- 
lation and those between closely related species, is popular 
for detecting selection. Several reports have been pub- 
lished on data analyses by using an extended M-K test. 
For example, Smith and Eyre-Walker (2002) analyzed data 
of Drosophila simulans and D. yakuba and emphasized that 
45% of amino acid substitutions have been fixed by positive 
selection. 

Sawyer et al. (2007) have performed a solid analysis of X 
chromosome data of Drosophila melanogaster and D. 
simulans, that is, polymorphisms in 91 genes in an African 
population of the former and the divergence between the 
two species. The result is interesting: about 70% of amino 
acid changing polymorphisms are slightly deleterious, but 
about 95% of fixed differences are positively selected. They 
also estimated that about 50% of amino acid changes are 
nearly neutral, if the near-neutrality is defined as |Nes| < 1 . 
They conclude that weak selection of which substantial frac- 
tion is nearly neutral is efficient for adaptive evolution of 
proteins. Therefore, adaptive protein evolution and near- 
neutrality are nicely unified. 

Functional approach on amino acid substitutions is be- 
coming to have impact on near-neutrality. Lunzer et al. 
(201 0) examined functional effects of single amino acid sub- 
stitutions in isopropylmalate dehydrogenase (IMDH) of 



Escherichia coli. They have found that the transition from 
E. coli IMDH to Pseudomonas aeruginosa IMDH cannot 
be explained without assuming slightly deleterious amino 
acid substitutions followed by compensatory substitutions. 
This nearly neutral process is thought to be caused by cryptic 
epistasis among amino acids within the protein molecule. 

By using genome-wide polymorphism data of closely re- 
lated species of yeast, Elyashiv et al. (201 0) have performed 
detailed statistical analyses to estimate the effect of purify- 
ing selection. The yeast species used are characterized by 
having several completely isolated subpopulations with 
varying sizes. They have found negative correlation between 
heterozygosity and the ratio of nonsynonymous to synony- 
mous polymorphisms on these subpopulations, whereas this 
ratio is larger for the within-population polymorphisms than 
on fixed differences among subpopulations. These results 
have verified the prevalence of weak purifying selection sup- 
porting the nearly neutral theory. They have further found 
that the shifts in intensity of selection across populations can 
be explained by the change of a single parameter, that is, 
population size. 

Evolution of Gene Regulation 

Evolution of gene regulation is under interplay of drift and 
selection. The best-studied case of gene regulation is the in- 
teraction of transcription factors and enhancers in the up- 
stream of the transcription start site. Gene regulation is 
thought to be in a well-balanced state, and any mutations 
that disturb the balance are deleterious. Ludwig et al. (2000) 
have found that the enhancer regions are in constant turn- 
over, that is, nucleotide substitutions at the enhancer region 
of the gene, even skipped, are occurring within the allowed 
latitude of stabilizing selection on this gene expression. In 
other words, the turnover is nearly neutral. 

By measuring the level of mRNA in various organs of hu- 
man and chimpanzee, Khaitovich et al. (2006) have clarified 
the relationship between diversity of mRNA levels among 
human individuals and the divergence of mRNA levels be- 
tween human and chimpanzee. They have found that the 
results for various organs have been mostly consistent with 
the nearly neutral prediction. The exception was testis such 
that the human-chimpanzee divergence is too rapid as com- 
pared with diversity among human individuals. The authors 
interpret that this rapid divergence reflects positive selection 
on gene expression for testis. 

By using quantitative genetics approach, Bedford and 
Hartl (2009) examined the observed variance of mRNA levels 
among seven Drosophila species. They have found that the 
variance initially increases with divergence time but eventu- 
ally saturates. Based on this observation, they estimated the 
intensity of stabilizing selection on expression levels and 
have found that the selection is weak, may be regarded 
as mostly nearly neutral. However, they emphasize that 
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divergence among the species is mostly attributed to posi- 
tive selection, and therefore, weak selection is effective. 

Phenomena That May Explain 
Abundance of Weak Selection 

In this section, let us consider why such a large fraction of 
mutations both at the protein level and at the level of gene 
regulation are subject to weak selection. 

Robustness of Genetic Regulatory Systems 

Although robustness of biological systems has been recog- 
nized since Waddington, molecular methods to tackle this 
subject have become available only recently. Our knowledge 
on this is now rapidly expanding and should be incorporated 
into discussions of evolution. Particularly, the concept seems 
to provide an important clue to the long-standing issue on 
how molecular changes and morphological evolution are 
linked. Also it is important to recognize that robustness in- 
creases the width of near-neutrality (Ohta 2002). 

Wagner (2005) argues that pathways of gene expression 
are complex, and numerous networks are entangled and 
that the system becomes robust to any perturbations. He 
calls this kind of robustness, "distributed robustness." An- 
other type of robustness is found by the morphological 
anomaly induced by mutations of the Hsp90 (Rutherford 
and Lindquist 1998). Hsp90 is a molecular chaperone and 
helps to stabilize signal transduction systems, and its defects 
cause phenotypic anomalies. Participation of microRNAs to 
the stability of various genetic systems is also suggested 
(Hornstein and Shomron 2006). So there are various mech- 
anisms to increase robustness. 

Following Lehner (2010), let us survey various types of 
robustness, that is, mutational (genetic), environmental, 
and stochastic ones. For the analysis, data on large-scale 
synthetic lethal or sick screens of yeast and other model or- 
ganisms are useful. By examining such data, Lehner (2010) 
has found that genetic robustness is often correlated with 
stochastic robustness and/or environmental robustness. In 
other words, when genetic perturbation by a mutation 
has minor effect but have large effects in combination with 
defects of other genes (i.e., synthetic lethal or sick and 
therefore robust), the gene shows resilience to environmen- 
tal stress (environmental robustness) and/or stochastic 
change (stochastic robustness). This negative interaction 
is equivalent to robustness, and mutations will have initially 
small (or nearly neutral) effects. Lehner (2010) argues that, 
under variable environment, resilience to many environmen- 
tal conditions is advantageous and selected for, and genetic 
robustness has evolved as a by-product of evolution of en- 
vironmental robustness 

Yeast data also provide some important characteristics of 
quantitative genetic interactions relating to robustness. Bel- 
trao et al. (2010) surveyed quantitative mapping of genetic 



interactions of yeast data on colony size of mutant strains. 
By using the multiplicative model, they classified the inter- 
action types into the following three, positive, neutral, and 
negative interactions. Positive interaction means that the fit- 
ness effect of the double mutants is less than that expected 
from the two single mutations. This type includes the sup- 
pressive and compensatory cases. The neutral interaction 
means that the two mutations are independent. Negative 
interaction means that effect of the double mutants is more 
than that expected from the two single mutations. This type 
includes robust cases. Beltrao et al. (201 0) have focused on 
proteins involved in chromatin biology, such as chromosome 
segregation, transcription, and chromatin modeling/remod- 
eling. They have found a module composed of three corre- 
lated protein clusters. Note that proteins form complex set 
consisting of functional modules, each of which is made of 
several clusters. Their significant finding is that, within each 
cluster, genetic interactions are mostly positive, whereas, 
among the clusters, they are mostly negative. Remember 
that the three clusters are involved in chromatin model- 
ing/remodeling, and proteins in each cluster often physically 
associate. It has often been emphasized that genetic inter- 
actions are rapidly changing in evolution as compared with 
proteins (Roguev et al. 2008; Tischler et al. 2008). In the 
chromatin modeling/remodeling system above, Beltrao 
et al. (2010) have further noticed that positive interactions 
are more conserved than negative interactions. By assuming 
that interactive effects parallel with fitness values, negative 
interactions result in more robust and more nearly neutral 
systems than positive interactions. Above findings based 
on genome-wide data have significant implications for 
our considerations on evolution of complex networks. 

Proteins are more conserved than genetic interaction sys- 
tems as mentioned before. However, note that the previous 
example of slightly deleterious plus compensatory amino 
acid substitutions in evolution of IDMH (Lunzer et al. 
2010) may be regarded as a robust system of amino acids 
that makes the protein. 

Epigenetics 

The meaning of epigenetics is broad and various processes 
involved in "the inheritance of variation above and beyond 
changes in the DNA sequence" are included (Bonasio et al. 
201 0). Here, let us define epigenetics as more than one phe- 
notypes under the same genotype that are heritable for 
some generations. Recent progress on our understanding 
of this concept at the molecular level is remarkable. Epige- 
netics is intimately related to robustness as both lie between 
genotypes and phenotypes, and their incorporation into 
evolutionary problems is a most urgent task. 

Among various processes involved in epigenetics, DNA 
methylation is best studied. It is thought that DNA methyl- 
ation is linked to chromatin structure and to participate in 
gene regulation. It has been known for some time that 
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the spectrum of genome methylation is very broad, that is, it 
is different among species and among loci within a genome 
and that its function is related to gene regulation. Following 
Suzuki and Bird (2008), let us briefly review progress on DNA 
methylation of higher organism genome. DNA methylation 
is found throughout the human genome except CpG is- 
lands. However, this global methylation pattern is limited 
to vertebrates, and mosaic patterns are observed in yeast 
and fruitfly. In plants, global pattern is seen, but the meth- 
ylation system of transposons is different from that of ani- 
mals. So there are similarities and differences among the 
species. 

The importance of methylation is its impact on gene reg- 
ulation. It has been known for some time that DNA meth- 
ylation works to silence gene expression. In the following, I 
present summary of Suzuki and Bird (2008). DNA methyla- 
tion pattern is different according to cell types and therefore 
may contribute to development. A notable progress on this 
topic is advancement of mapping of methylation of ge- 
nomes. Comparison of such methylation maps provides ba- 
sic data for considering the meaning of within-species 
(among individuals) and between-species differences of 
methylation patterns. Also active research is going on 
how DNA methylation map is different among various hu- 
man tissues. 

Next, I discuss an interesting idea on how DNA methyl- 
ation may contribute to evolution of complex systems. Var- 
iation of DNA methylation at regulatory regions among 
individuals of the same genotype increases variation of 
gene expression and hence variability of phenotypes. Fein- 
berg and Irizarry (2010) consider that the increase of var- 
iability is important for coping with environmental stresses 
and examined variation of methylation patterns in human 
and mouse tissues. They have identified the locations of 
variable regions of DNA methylation among individuals 
in mouse liver and termed them variably methylated re- 
gions (VMRs). They have extended the analysis to mouse 
brain, human liver, and human brain and found that VMRs 
are often associated with genes for development and mor- 
phogenesis. It has also been noted that the variation in 
methylation is likely to be caused by gain or loss of CpG 
in regions involved in gene regulation. Based on these find- 
ings, they argue that, if variable methylation at a regulatory 
region of a gene is advantageous under variable environ- 
ments, its genetic basis, that is, loss or gain of CpG dinu- 
cleotides, is selected for, and provides plasticity in the 
developmental program. Note that variation of gene ex- 
pression is intimately related to robustness because it is re- 
ciprocal of stochastic robustness (Kaneko 2007), and the 
latter is linked to genetic and environmental robustness 
(Lehner 2010). Both epigenetics and robustness are in- 
volved in processes that lie between genotype and pheno- 
type, and their effects on weak selection must be 
significant. 
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Fig. 2. — A scenario to show the meaning of near-neutrality in 
relation to epigenetics and robustness that connect phenotypes with 
genotypes. 

Nearly Neutral Zone 

Let us now consider the problem of drift and selection, that 
is, the nearly neutral zone of figure 1 in relation to robust- 
ness and epigenetics. Natural selection works on pheno- 
types, but its effects are transmitted by underlying 
genotypes, and the two are linked by epigenetics and ro- 
bustness of organismal development. Figure 2 gives a sce- 
nario. Among epigenetic changes, only those that have 
minute or favorable effects would survive. As pointed out 
before, it is important to notice that epigenetic processes 
provide variable phenotypes that may be useful in changing 
environments because regulatory systems would readily re- 
spond to environmental stresses. 

In relation to robustness, the human genome has extra- 
activity as shown by The ENCODE Project Consortium 
(2007), that is, transcriptional activity is more widespread 
in human genome than expected from protein-coding re- 
gions. Such activities are thought to be mainly nearly neu- 
tral, that is, they occur and may survive in allowed latitude 
under weak selection, drift, and robustness. Actual patterns 
on gene expression divergence analyzed by Bedford and 
Hartl (2009) and by Khaitovich et al. (2006), mentioned pre- 
viously, may fit to the present scenario. 

From the above scenario, it may be said that near-neutral- 
ity prevails at the genotype level but that natural selection 
may predominate at the phenotype level. Important points 
of this scenario are environmental contacts to gene expres- 
sion and availability of enough room for modifying gene 
networks. Note also that, under the scenario, some pro- 
posals on epigenetic effects in evolution and variation 
(Jablonka and Lamb 2002; West-Eberhard 2003; Slatkin 
2009) may become more understandable. As to "Chance 
and Necessity" by Jacques Monod, he might have consid- 
ered natural selection at the phenotype level because mo- 
lecular bases of epigenetics and robustness were not known 
40 years ago. Here, I would like to emphasize again that this 
scenario may have significant implications for understand- 
ing evolution of enormously complicated systems at various 
levels in biological world. 
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